Methodology and Tools to Reconcile Data
نویسندگان
چکیده
A data integration system (DIS) provides access to a set of heterogeneous data sources through a so-called global schema. There are basically two approaches for designing a DIS. In the global-as-view (GAV) approach, one defines the elements in the global schema as views over the sources, whereas in the local-as-view (LAV) approach, one characterizes the sources as views over the global schema. In this paper we propose methodologies to reconcile data, both for LAV and GAV. For LAV, we propose to declaratively specify reconciliation correspondences to be used to solve conflicts among data in different sources, and define an algorithm that rewrites queries posed on the global schema in terms of both the source elements and the reconciliation correspondences. For GAV, it is a common opinion that query processing is much easier than in LAV, where query processing is similar to query answering with incomplete information. However, we show that, when constraints are expressed over the global schema, the problem of incomplete information arises in GAV as well. We provide a general semantics for a GAV DIS, and specify algorithms for query answering in the presence of both incompleteness of the sources and inconsistencies between the data at the sources and the constraints on the global schema.
منابع مشابه
A Proposed Data Mining Methodology and its Application to Industrial Procedures
Data mining is the process of discovering correlations, patterns, trends or relationships by searching through a large amount of data stored in repositories, corporate databases, and data warehouses. Industrial procedures with the help of engineers, managers, and other specialists, comprise a broad field and have many tools and techniques in their problem-solving arsenal. The purpose of this st...
متن کاملCustomer Retention Based on the Number of Purchase: A Data Mining Approach
Purpose: this study wants to find any relationship between the numbers of purchase and the income the customer brings to the company. The attempt is to find those customers who buy more than one life insurance policy and represent the signs of good payments at the same time by the help of data mining tools. Design/ methodology/ approach: the approach of this research is to use data mining tools...
متن کاملModelling the Level of Adoption of Analytical Tools; An Implementation of Multi-Criteria Evidential Reasoning
In the future, competitive advantages will be given to organisations that can extract valuable information from massive data and make better decisions. In most cases, this data comes from multiple sources. Therefore, the challenge is to aggregate them into a common framework in order to make them meaningful and useful.This paper will first review the most important multi-criteria decision analy...
متن کاملIdentification of the Patient Requirements Using Lean Six Sigma and Data Mining
Lean health care is one of new managing approaches putting the patient at the core of each change. Lean construction is based on visualization for understanding and prioritizing imporvments. By using only visualization techniques, so much important information could be missed. In order to prioritize and select improvements, it’s essential to integrate new analysis tools to achieve a good unders...
متن کاملA Methodology for Product Performance Analysis under Effects of Multi-Physical Phenomena
Due to the development of science and technology, the computer has become a useful tool for supporting engineering activities in product design. Many computer aided tools such as CAD/CAM, product data management (PDM), product life cycle assessment (PLA), etc., have been popularly used in industry for reducing product development lead-time and increasing total product quality. However, the nume...
متن کامل